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REMARKS 



The Office Action mailed December 4. 2003 has been carefully reviewed and the 
foregoing amendments axe made in response thereto. In view of the amendments and the 
following remarks. Applicants respectfully inquest reconsideration and reexamination of this 
application and the timely allowance of the pending claim. 

The specification has been amended herein to update references to patent applications 

that have since issued as US patents. 

Claims 1, 2, and 7 have been amended to more clearly claim the invention. Claim 1 has 
been amended to claim a set of tag probes selected from SEQ ID NO: 1-2000. This amendment 
is supported, for example, by figure 2 where SEQ ID NO: 1 is identified as a "tag probe" and its 
complement is identified as a 'tag". Claim 1 has been further amended to add the limitation that 
the set comprises at least 1000 sequences from SEQ ID NO: 1-2000. This amendment is 
supported by original claim 4. Claim 2 has been similariy amended to claim a set of at least 
1000 -tas-" which are the complements of SEQ ID NO; 1-2000. Claim 7 has been amended to 

be dependent on claim 1. 

Claims 15-19 are newly added. Support for claim 15 may be found in originally filed 
claim 7. Support for claims 16-19 may be found in figure 2 and on page 8 last line and page 9, 
lines 1-4. Claims 3-6 and claims 8-14 have been canceled herein. 



Rejection of Claims 1-12 Under 35 VSC 101 

Claims 1-12 have been rejected by the Examiner as lacking patentable utility. Applicants 
respectfully disagree. The specific set of tags and tag probes claimed have both a well 
established utility and a substantial and specific utility disclosed in the specification. The set of 
sequences are useful as tag probes and their complements are useful as tags that hybridize to the 
tag probes. The set of tag probes, particularly when attached to a solid support so that each tag 
pi^be sequence is in a fcamre of known location, is useful to detect sequences that have been 
labeled with the complementary tag. The tags may be used to label, for example, primers which 
may be used for a variety of purposes, such as Single Base Extension (SEE). The tagged primers 
can be labeled in an assay specific manner and detected by hybridization to the array of tag 
probes. This utility is specific, asserted in the specification and well established. 
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I. ^gards to the Exan^ner's observation tha. the sequences lac. 100% speciftcity to any 

■ ■ r^nBank this is accurate, but this is a feature of the utility of the 
particular organism m GenBank, ttus is accm , 

e^u^ces. T.e se,ue„ce. w.e select, so .h. ..ey are „o. co„>p._ - »^ 

peciflcadon page S, line, 23-24.) What is daimed is a set o, tags and tag ptob^. The 
se^ences ate selected so .ha, they hybddize -nder similar conditions to a cotrespondtng ag 
p„be and so .ha, tags in the set hybridize only to theix cotresponding tag p^be and no, to other 
Lg probes i„ d,e set. This allows the Ugged sequence to be specifically de,cc«d by 

J- t«<T T^rnVv Anv eenomic sequence can be tagged and then 
hybridization to its corresponding tag probe. Any genomi 4 

detected on an array of tag probes. T^c array of tag probes can be used to detect any tagged 
sequences and a researcher therefore has the flcxibihty to use the same atray design to detect 

, u Tht^ same arrav can be used for many different 

virtually any sequence that can be tagged. The same array 

experiments. See Exhibit 1 "GeneChip GenFlex Tag Array" Product insert. 

SEQ ID NO- 1-2050 were selected from a list of randpm sequence 20 mer sequences. 
Member, of the set were selected so that each member of the set hybridizes to its complementary 
tag probe (the perfect complement of the tag over the 20 nucleotide length of the probe) ^.Uh 
Closely matched melting temperatures. Members of the set were also selected so that they do not 
cross hybridize to another member of the set or to sequences in the public database. 

At least one specific utility is asserted in the specification. In ti.e specificauon on page 
10. lines 15-22 one use of tagged primers and an array of tag probes for genotyping is disclosed 
as follows (emphasis added): 

polynucleosis . the po^o^h,c^~^^^^^ ^^^^ 

amount or each aisunci laoei i^axi uc i. j- *• ^i. r-ft^rf^^P'nts a 

S tag represents a ^ P— ^^^^^^^^^ 

t:^^^!^':iXf T:^^^^^ JUTL^etei^nation of allele 
fr?q^^Sn a popJanon'^ Another embodiment employs just steps (2) and (3). 

In this embodiment, SEQ ID NO: 1-2050 are present on an array as tag probes and the 
complements of SEQ ID NO; 1-2000 are used as the tag portion of the sequence tagged pnmers. 
For each polymorphic locus to be analyzed a sequence-tagged primer with a unique tag ,s used. 
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T^. sequence tagged primed also have a 3' region th.t hybridizes inunediately upstr.^ of the 
polymorphic locus. The ^uence tagged primers are hybridized to the ampUfied fragmer^ts 
cont^ning the polymorphic loci a.d they are extended by a single base eorresponding to the 
polymorphic base. Each species of dNTP present in the reaction is diffet^tially labeled. The 

. T^rim^r*: are then hybridized to the array of tag probes and the 
labeled sequence-tagged pnmers are men nyun^ 

^u^^ TV,*. MO nrobes are oresent at known locations so the genotype 
hybridization pattern is analyzed. The tag proocs arc pi^^^cxi 

u- ,11^1^ Tr.«v determined by determining which label or labels have 
of each polymorphic allele may tw aetemiincu o 

-.-x:^™ to<7 TiTr.he This method was Used in Fan et al. a copy of which 
hybridized to the correspondmg tag prooe. inis jnemuLi w 

has been provided with this response. 

The utihty of the claimed invention is also well established. Exhibit 3 includes a list of 
publications describing studies that have used the GenFlex tag array or sequences from SEQ ID 
NO- 1-2000 The prior art, for example. Shoemaker et al. Nat. Genet. 14:450-456 (1996), a copy 
of which has been provided with this response, demonstrates the well established utility of sets of 
20 base tag sequences that can be detected by hybridization to a tag array and that can be used as 
unique identifiers. 

Finally, the Examiner indicates that there is nothing specific to the 2050 sequences that 
distin<mishes them from a different set of 2050 sequences or from any set of 2010 unrelated 
seqoelces. Applicants respectfully disagree. This set of 2050 sequences wa. carefully selected 
to match a selected set of criteria. They are all the same length, they were not prescm m the 
GenBank database, they have closely matched melting temperatures so that they are capable of 
specifically hybridizing to their corresponding target sequence under the same hybridization 
conditions, allowing the hybridization of all 2050 probes to be done in parallel in the same 
reaction and no two sequences in the 2050 sequences will hybridize to the same target under the 
selected hybridization conditions. There is an almost infinite number of sequences that would 
not match each of these criteria and would therefore not be suitable for use as tag-tag probe 
sequences. SEQ ID NO: 1-2050 were further selected from a group of 2200 sequences that met 
the criteria described above, based on the ability of those 2050 to discriminate between the 
perfect match and mismatch probes. For additional details on how SEQ ID NO: 1-2050 were 
selected see Exhibit 2, GeneChip GenFlex Tag Array Technical Note No. 1. 
Applicants respectfully request withdrawal of the rejection. 
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Rejection 0f Claims 1-12 Under 35 USC 112 

Claims 1-12 have been rejected as noi complying with the enablement requirement. The 
ExaMner asserts that the claimed subject matter was not described in such a way as to enable 
one skilled in the art to make and or use the invention. Applicants respectfully disagree. The 
invention as claimed is a set of sequences that can be used as tag probes and their complements 
(ta-s). Each member of the set hybridizes specifically to its complemem under sirmlar 
conditions and does not cross hybridize with the complement of any other member of the set 
under the selected hybridization conditions. The members of the set also do not hybridize to 
sequences present in the public databases under stringent hybridization conditions- The set has 
also been empirically tested so that the members of the set are known to discriminate between 
the perfect match probe and the mismatch probe with high efficiency. 

The specification provides at least two ways that the tags and tag probes may be used. 
First they may be used to tag individual deletion mutations. This has been used in yeast to 
identify open reading frames that are essential for growth under selected conditions. Briefly, 
each strain has a single deletion, each deletion is tagged with a different tag, the different strains 
are grown in the same culture in the selected conditions. The nucleic acid is isolated and 
analyzed using an aixay of tag probes. The tags that are present indicate deletions that are 
tolerated under the selected conditions and the tags that are absent indicate deletions that axe not 
tolerated under the selected conditions. Second the tags may be used to label primers for locus 
specific genotyping as described above. Both of these uses are described on page 10 of the 
specification. Applicants respectfully request withdrawal of the rejection. 



-9 - 



PAGE 1 0/33 • RCVD AT 3/4/2004 8:30:31 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-1/0 * DNIS:8729306 * CSID:4087315392 * DURATION (mni-ss):1440 



Mar-04-04 06:32piii Froni-Af fymetrix, Inc. 



408 731 5392 



T-964 p. 0 11/033 F- 



ATTORNEY DOCKET NO.: 3108.1 
Application Serial No. 09/827,383 



CONCLUSION 

In view Of the foregoing amendments and remarks. Applicants believe all pending claims 
axe now in condition for allowance and should be passed to issue. If the Examiner feels that a 
telephonic interview would in any way expedite the prosecution and allowance of this 
appUcation, please do not hesitate to call the undersigned at (408) 731-5768. The Commissioner 
is hereby authorized to charge any additional fees which may be required, or credit any 
oveipayment to Deposit Account 01-0431. 

Dated; ]^-rah 4. 2004 Respectfully submitted, 

'Sandra E. Wells 
Reg. No. 52,349 

Affymetiix, Inc. 

3380 Central Expressway 

Saxita Clara, CA 95051 

Tel: 408-731-5000 

Fax:408-731-5392 
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EXHIBIT 1 



GeneChip^ 
GenFlex™ Ta 




Array 




Flexible GeneChip® 
Array 

'The GenFlcx™ Tag Array (P/N 90O3O2) 
is the first in a powerful new line of 
flexible GcrteChip* probe arrays from 
Affymctrix. The GcnFlex™ Tag Airxy 
uses nutlcic acid hybridization zo 
pArse as many as 2000 individual 
reacdon products from complex 
pooled mixrureS- You can now creace 
and iinplement hybridiza^ion^baSed 
assays of your own design. 

Unlimited Applications 

The GenFlwc™ Tag Array is virtually 
ujilimiced in ics potcTicial ro supporc 
new apphcd-tions. Any reactiOrt in 
which a. nucleic add rag can be 
incorporated into the products may be 
amenable co analysis with chc array. 
GeneChip* probe arriys are ideally 
suited for separadng complex 
mixrures, enabling highly multiplexed 
assay srrategies. 

Examples of appiicacions considered 
include the following; 

• Analysis of mutant strains of 
Sdccbaroinyces cer^visutA 

• Genocyping of single nucleodde 
polymorphisms (SNPs)^ 




Figure 1 . The GenFlex™ Tag Array Approach. A cypicil assay is d<jpicc«i in whidi 
chc Tiig sequence (colored 5«gi«4ncs) have been incorpofaxed inro a S«c of reaction ppoductsr In 
chis cxampJ*, a sec of nucleic acid sequences h«iv« been inrcrrogaced by a biocWcmical rta^don 
rejulcing in the addirion of a nucleotide (A,C,G, orT) to each Tag-concaining molecule. The 
nucleotide is labeled widi one of fwo difFcrcnt fluorophorw (red and yellow circles). The assay may 
be carried ouc in multiples: rcacdonS which arc then combined inco a smgU pool. The tidaiih of 
each individual reiunion (i.e., which lab^jled base y/as incorporated) can be deC*rmincd by 
hybridiCTCicin co che GenFlcx'^" Tag An-ay Th« Tag-probe (itippled colored scgmcnK) on chc 
GenFlcx™ T?.g Array capCu«s che corrMpoading r<Mu:cioi\ product by hybridizing co die Tag 
scqwsrtce. The b/bridizanon signal dctccC«J for fiacb of the 2000 Tags on the GcnFlex^ Tag 
Array reveals uhc rtisolcs ofe^ich irtdividiinl reaccion. 



Aflymetrix has partnered wjch Orchid 
Biosciences, Inc. to develop SNP-IT™ 
assaySj analogous to che single base 
extension strategy employed by Fan etal^ 
for SNP genoxyping for use with che 
GcnFlex"^ Tag Array 



Note: A license is required Jrom 
Orchid BioSd^ces, Inc. to ^^^nducr sin^ boi'e 
extension assays, called SNP-VT^ assnyTf wkh 
CencChip^ CtnTicx''" Arrays. 
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50 micron 



Lm«»»r«nB^«ro=^ly ^^^Z^M'^^^^*^^^^ probe AflyinW'. 
pcifecc compUment ~ che T>B, « .'^^'^f^* '^^?^e.p„bc widTa different bwe !ir pos'""" 

rion pmduct. hybridi*i=s m^C Strongly ^ T^g-?^^ 

fluoropHore is incorporaced into rhe 
Tag-concaining molecule. The neamon 
products, up CO 2000, are pooUd and 
hybridized to che GenFleJt T^g Amy- 
The resales of each individual reaction 
can then be f^$ctm&d by die hybridization 
signal intensicy detecced for the 
corresponding Tag-probe sets. 

Fifty Quorescein-labeled Tag 
oligotiucleocides are included and 
should be U5ed as hybridiajaxion 
controls wirfi every array. 



The Tag Array Strategy 

The GenFlttX^ Tag Array is comprised 
of capture probes for 2000 Tag 
5cquence$. These are 20m«s which were 
seLecced from ^ possible 20mcrs to 
have similar hybridization characterisdcs 
and minimal homology to sequences 
in the public database^. The selection 
and characterization of the Tag 
sequences is further described in the 
Technical Note (P/N 700428) and to 
facilitate assay design the Technica 
Nore and the Tag sequences are 
available on a CD-ROM (P/N 610026). 

Any assay in which nucleic acid Tag 
sequences can be incorporated is a 
candid;*ce applicadon. Tag' refers to 
the sequence that Is being captured by 
the array. The Tag-probe sec refers co 
the four sequcpt^ts on the array used to 
analyze a given Tag, one of which is the 
perfect complement of the Tag (figure 2), 

Typically, the Tag sequence is 
covalcndy attached to a molecule of 
incerest, for example a DNA primer. 
The linked moU*^ule then is the target 
of a biochemical reacrion in which a 



Arr^y size 



mtm 



Oligo length 



20mecs 



Control sc^iienccs 
included 

Hybridixaiion ccricrols 



GcnflcK™'I^ArT^iy 




Benefits 

* Flexible - f>esign your own assays 
with a standard array. 

• HighlyparaUel- Up to 2000 reactions 
analyzed in a. single experiment. 

- Cost-effective -The array is 
compccidvely priced and supports 
cost-reducing multiplex strategies 
to enable effective high- throughput 
assays. 

• Proven platform - The GeneCliip" 
system feprcs(=ncs the scate of the arc 
In accurate, reproducible microartay 
analysis. 
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TECHNICAL NOTE No. 1 



■> /-v.,:: 



'*-' .' . ■ * 'V, 



' 1 



JET 



en^jWes tfce interrogation of up vo 
2000 nucleic acid reaction products 
and SO control oligonucleotides. 
This document describes the selection 
and evaluation of the Tag-probe 
sequences included on the array ^ 
Here Tag' refers to the sequence 
that is being captured by the array, 
le., a sequence t^at is attached to 
a, reaction product of interest. The 
'Tag-probe^ refers to the sequence 
on Hje army, the complement of the 
Tag, Thi^, the reaction product into 
which the Tag has been incorporated 
will hybridize to the corresponding 
Tag-probe on the array ^ The Tag 
sequences to be used in primer 
desim are provided free of charge 
on a CD-ROM (P/N 61 0026)^ 



r ■-■Tf- 




mismacch-com^aining Tag-probe 
wich a different base at posinon 
10 h referred co as "MM" in cbe 
HAW. Two addidonal control 
probes are induded on doe artay and 
reporced in die HAW, the ^^CPM" 
and ''CMM " These probes arc 
designed to be the complements 
to the PM and MM Tag-probes, 
respectively. 



Miction 



Four different sequences are tiled 
on the array to interrogate each 
Tag (see figure 1, Tiling strategy). 
The Affymetrix* Microarray Suite 
sofcwarc output, the Hybridizadon 
Analysis Window (HAW), reports 
the background subtracted signal 
incensides for each of diese feaiures. 
Tlie probe designed co be the perfect 
match complexxienu to the Tag, 
(i.e.. Tag-probe) is referred co as 
"PM" in the HAW of Aff)^metrix* 
Microarray Suite software. The 
probe designed to be the 




An initial set of iOmet Tag-probe 
sequences-was selected with closely 



matched melting temperatures. 
A further filter/based on Asymetrix 
expression rules and probe array 
experience, was appHcd to optimize 
and standardize the hybridization 
characteristics of the sec. Finally, 
Tag-probe sequences were 
removed if they were identical or 
nearly identical to each other, to 
control sequences used on the 
array borders, or to sequences in 
the public databases at the time 
of array design. The GenFW 
Tag Array was designed with 2200 
20mer sequences. 




Figure!. GcnFle*^ Tag Array Tiling Strategy 

Accual probe features from a GcnFlex-T:.g Array are -ho^vT^ Tl^^Y 

Tag-probes, stamed with streptavidin-phyco.rythnn, and 'J^^^^'^^^^ 

, u T ^>.,^«*r T7oLir features orcamzedvercically on the prODe array, 

^^-^ ^^^^ ,he c.^). For cch of the four 

''^"hoirirlnl" hori.o«tmy across chc probe o^.y. d.e brighresc 
^£^ZoT:X^s sec. with ^h. "PM- fearur.. This ^--^ "^-P;-^ 
of d,e Tag-prcbf oligonuckodde, che perfect complement to the Tag. Scxnc 
hybridisation is also seen ^o th. MM fesvn.r<: (note T-s 4J. 
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The Tag-probe sequenceSj in a 
3' -5' oriencarion, arc provided in 
Che GeneChip« Hybridizadon 
Analysis Window (see Tigure 5) 
for data tTacking purposes oj:ily- 



A 



mm 



GeoFtexT- 2050 Tags; Relative Intensity Distribution 



lEcOO Prttoe lf(*^ 

ta 50 Cofitral Tags 





s « s « s I * 

Relative Ln{lntenaity] [Ave, n=2] 



The hybridizadon performance 
of the entire sec of 2200 Tag 
sequences was cvaluaced- 
Biodnylatcd oligonudeocides 
complementary CO the PM 
features, ajid thus called 'Tags% 
were synthesized and hybridized 
to the GenFle^"' Tag Array. As 
shown in figure 2, the normalised 
natural logarithm of the signal 
intensities obtained are distribnred 
about a geometric mean of 0.8 
with a standard deviation less 
than 0.1. As showii in figure 3, 
chc Tag hybridizations yield high 
PM/MM ratios (a measure of the 
spedfidty of the Tag hybridization) 
with greater than 98% generating 
a PM/MM ratio greater than 3/1. 
Figure 4 shows a plot of the 
discriininarion score and the 
signal intensity for all 2200 
sequences. A line was fitted to 
select the 2050 sequences with 
the highest discrimination AND 
signal intensity (i.e., those above 
and to the right of the fitted Unc). 
Fifty individual "concror 
sequences were selected from the 
2050 sequences to be 
representative of the population 
with respect to observed signal 
intensities and discrimination 
(see figure 2 & 3). Tag sequences 
with relatively low signals are 
over-rcpresenced in the 50 control 
sequences so as to increase 
information about the sensitivity 
of experiments at the lower limic 
of detection. Fifty fliiorescein- 
labeled oligonucleotides or Tags, 
(Le., complementary to the control 
Tag-probes on the array) are 
included with the GcnFlex^ Tag 

Array Kit (P/N 900302) and should ' .^'rt„ of G^lex " Tags and controls 

be used as internal standards wich Figure 3. Signal d.scrmunat.on of G^lex g 
every hybtidizaticn experiment. ^^^^ daca presented in Bgare 2, .he PM/MM i^tios are plowed. 

Grcacec chan 98% of th^ Tags gencrace a raoo of grearei than 3/1. 
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Figure 2. Signal i^nsiti^^ of GenFlex- Tags and controls 



Signal incen5icies were collected and averaged from cwo independenc 
!v£rentT^^ the 2000 biotinylatcd Tag cUgonucleoades or che 50 ^ 

expenmencs in wmun ^vx^. ^^„M/r.r^Hes were hybridized co the GenFIex 

fluorescein-labeled control ohgonucleot des were nyonoi^ 
tL Array. The frequency of results are shown aS normab^ed (to .cale ot 1, 
L bi^of 0.05) natural logatichms of the nec signal incensiaes. 



GenFtex^'^ 2050 Tags; Discrimination Distribution 
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Figure 4. Selection of Tags 



The nacural logarichm of the 
average signal incensity And 
the discrimina-rion score 
[(PM-MM)/CPM+MM)1 is 
ploixed for each of the 2200 
Tags. A line was fitted to 
select the 2050 T^xgs with che 
highest signal intensities and 
highest discrimination 
scores. 




Pi^^e S. THe H>.Hai..^ A^y.i. Window. ^^^^ 

The baclcground-sub^acted signal mtensiaes '^^J'^'^XyZt^Lcd Rom "PM, MM, CPU and CMM^ v=dues^ 
each Tag. 'BG' refers co .he background '^f^^^'^t^Zl^^l'Xslo and 570 .m. Th. scquex^ces sho^ in .he HAW 
Dsra are collecced and reported for each of two different ^^^^^^"S^':' 3, 5. orieii.adon. 
are those of the oUgonucleotides synthesized on the probe ^xf^y and ^ gwen in the i t 
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..... ^ . . 

t^chnolog^. ^odiicts ^nd services 
are inark^Jted sur>d s^^^ 
incemationaily. For additional 
technical and pricing ^ 
informatiqn about thc'HuSNP 
mappii^g assay and other 
GcneChip^ probe arra)^ products, 
or to find the name and locadon 
of the distributor ojr^ales office 
nearest yo\a, please, contact us at 
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Exhibit 3 Ust of Publications Using GenFlcx tag array 



1. 



2. 



4. 



f ««„fii^^ taff arrav and pyrosequencing in SNP genotyping. 
Srrc " rL'l^arSSular D^agnoLs 5. 243-9, 2003. 

ph^owic analysis of sporulatLon and postg^rminanon grovvth m 

^^^Zr^^^tT^^r.^ of the National Acadeo^y of S^enccs of the 

I^h^^^^^^ -.H.ae„sUy oU.o.ucleoUde tag 

F^'^y's flZ Genome Research 10(6), 853-60. 2000. 

S^lLtt? ^Jhole-genome technologies reveal the ceUular response to 

PSSTsTAt^iSi"^^^^^^ the National Academy of Sciences of the United 

States of America 99(3). 1461-6, 2002. 

5. Functional prormng of the Sacrharo.nyc« 

Oij^ftver G etal Nature 418(6896). 387-91, 20O2. ^ ,^^ta j 

6. ^iT^^ly ^^l^T^^^ to the UV- and MMS-induced DNA damage 

'f^f Proceedings of the National Academy of Sciences of the United States 
nf America 99(16), 10605-10, 2002. 

7. compaiiin of GeneChip and SAGE on the quantitative accuracy m 

transcript profiling analysis. 

T^hii M efai. Genomics 68(2). 136-43. 2000. 

ismi, ivi. ei CM. vjv- V y r»xr A camnl^ bv a four-colour microarray 

8. MulUplex SNP genotypuig m pooled DNA samples oy a lour 

?^n!S^s K «aZ Nucleic Acids Reseaich 30(14). E70, 2002. 

9. Chr^osom:.^^^^ of haplotype blocks and the role of recombination 

S etal Nature Genetics 33(3), 382-7, 2003. 
10 I^S^ce enaSsiV of microarrays appUed to the ionizmg r-d^^f- -^JJ^^ 
Tu^J V. G. J. Proceedings of the National Academy of Sciences of the United 
States of America 98(9). 5116-21. 2001. 
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Parallel Genotyping of Human SNPs Usmg Generic 
High-density Oligonucleotide Tag Arrays 

o- c „ 1 i x.aooionn Cher ^ Marc K. Halushka.^ Anthony Berno,i 

H%;"= Ryde;; Robert |. Llpshutz/^^ David ,. LoCha./^^ 

and Aravlnda Chakr-avart^ ^^^^^ ^^^^^^ ^^^^^ ^ 

,^pUf,«rions Of genomic rtglons """'"l^^^.^^/^^^^^riJ " ^'^^ *P*"^^ ^NP loa 

Single base extension [SBE) reactions using ,rray. The SBE primers, terminating one 

and 5' complementarity to sp«:.fic P™^' ^[.",?;frJ^n7e oflabded dldeoxy NTPs, using . different label 
base before the polymorphic site, j*^"/'^. ^ *^ thTt^^ array. Third, genotypes are deduced from the 
for each of the two SNP alleles, and hybndized l^JJ^^^^^r^^L^S^ of multiplexed sample preparation, 
fluorescence intensity ratio of 'J^^^'^J^^^^^^"^^ Test thk me*od by genotyping 44 Individuals for 
hybridization, and analysis at each ^f^' ^l^^f""^^^^^,^^ Because the hybridization results are 

:^anS:^?h^ ^n.^.. . samp.es. 



The Human Genome Project and other private efforts 
are producing large amounts of genome sequence and 
pcl^orphism data that wiU provide scientists T^th an 
unprecedented opportunity to probe the ^t^^^"')^ 
function of the human genome (Collins et al- 1998). In 
the realm of human disease, these genomic resources 
will allow the dissection of th« genetic components 
and molecular mechanisms of co^^P'^' 'j^^" "^.^ 
eases and traits. Identification of complex disease 
genes will require both linkage and association analy- 
ses of thousands of polymorphisms across the human 
genome in thousands of individuals (Bisch and vi^" 
Lngas 1996; Collins etal. 1997; Ct«kiavarti 1999). To 
enable such large-scale polymorphism analysis in hu- 
man studies, parallel and efficient genotyping n»«hods 
are critically needed. The most common vanant in the 
human genome is the single nucleotide polymorphism 
(SNP) (Wang et al. 1998; Cargill et al- 1999; Halushka et 
al 1999). Homogenous and micro array-based min- 
isequencing has been used to genotype SNPs in human 
populations (Syvanen et al- 1990; Kuppusw^y et =a. 
1991- Chen and Kwok 1997; Pastinen et al. 1997,1998, 
Svvaken 1998). We present a parallel genotyping 
inethod for SNPs, termed TAG^SBE, which analyzes al- 

SHw^onti. RcJeanA Foundctlon fONO, Son OitgOr CaWorme 92121 UW. 

S5:;srKS.Ch^°V™«tri^om; fax («»8) a,*^. 



lele-spedfic single base extension (SBE) reactions on 
standardized, generic high-density oligonucleotide 
probe arrays (Ghee et al. 1996; Shoemaker et ^-1996, 
Wang et I 1998; Lipshut! et al. 1999). In TAG-SIiE, 
the anay is independent of the specific markers geno- 
tvped and the assay can be oistomlzed for sets of marl^- 
ttltl^rough PCR and SBE primer selection. Because this 
genotyping method is generic, -t^^-^^ P"^'^ 1 
and favors multiplexed reactions, TAG-SBE is weii- 
suited for large-scale human genetic studies. 

To design the tag arrays, all possible 20 mers (4 
or -10'*) were subjected to a computational screen 
that favored a subset of sequences with similar GO con- 
tent and theimodynamic properties, and eUmuiatcd 
sequences with possible secondary structure or se- 
quence similarity to other tags (Shoemaker et al. 1996 
Giaever et al. 1999; Winzeler et al. 1999). A set of 
32 000 tags was select«d. with all tags expected to have 
similar hybridization characteristics and minimal 
cross-hybridixation under standard hybridization con- 
ditions. As a hybridiiation control and to enable back- 
cround and cross-hybridization subtraction, each tag 
probe (PM, perfect match) is paired with a second 
crobe that is identical in sequence except for a single 
base difference at the central position (MM. mis- 
match). The high-density tag array used in this study 
consists of over 64,000 distinct probes, over 32,000 PM 
tag probes, and over 32,000 adjacent MM probes, each 
probe occupying an area of 30 x 30 pm. 
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The TAG-SBE genotyping method pairs the exten- 
sion Dritner for each marker with a unique tag se^ 
qu^^^^^^^^ the deconvolutiou of -uluplex^^^ 
Reparations on a single high-den. ty P^ot^-^^y^^^ 
\\ The TAG-5BE approach can also \>e multipl<>xed 
1). The TAt^is^JPP (see below), 

both at the pirtmary PCR ana me ^du \* ... 
The resulting hybridization pattern from ^ typical 
S^SBE as5ay is shown in Hgure 2A, The intensities of 

the two fluorophores used are "^^f-^f^taS^ 
for background and spectral overlap. The quantitative 
hybridization results axe then used to make genotype 

calls (FiKutc ZB). 

we first tested wtiether SBE mettiods S^^^JT, 
ing could be simplified. Previously published SBi: 
methods 5ucb as minisequencing (Pastinen et al- 
1997 1998; Syvanen 1998) and genetic b>t analysis O^i- 
Sorov et ari994; Head et al. 1^97) r^uir^ *at 
double-Stranded templates iJe converted to single- 
stranded templates prior to the base extension reaction 
[although double-stranded templates have been suc- 
cessfully used ID fluorescence energy transfei-based 
SBE assays (Chen et al. 1997)]. We compared the TAG- 
SBE results obtained mth three SNP markers using 



both single-stianded and double-stranded PCR prod- 
ucts as templates, and found similar two-color mten- 
sity ratios and no significant differences in the absolute 
hybridization signal intensities. Thus, for subse- 
quent analyses, and the assays descnbed h^e, double- 
stranded PCR templates were used in the SBE reactions. 

To test the robustness, accuracy, and efficiency or 
the TAG^BE method, we developed genotyping assays 
for a subset of the 874 SNPs that were identlHed re- 
cently in a large-scale polymorphism screen of 75 hy- 
pertension candidate genes (Halushka et al. 1999). Of 
these we chose 1 71 SNPs in 68 genes, focusing on SNPs 
likely to have a functional significance: We chose SNPs 
in promoter regions, at splice junctions, and those that 
altered protein sequence. PCR primers were designed 
and tested individually for each of the 171 S^P- 
containing genomic regions. Of these, eight (4 7%) 
failed to amplify, and SBE primers were designed for 
the remaining 163 SNPs. We did not attempt to rescue 
the failed PCRs at this point, but this could be done if 
needed by respecting primers or through a modifica- 
tion of the standard PCR conditions. For six of the 163 
SNPs, SBE primers were designed for both the forward 
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c3 « -s CA^ Fluorescence imag^ of a small portion of an oligonucleotide tag 
Figure 2 [^A) Fiuore&cenctc '"'"s^ . <ftr rtftmera The entire array contains 
array following hybridization J^^^L^V^^^^^^^ MM probes for 

a single tag probe pair are labeled. (B)TTig J^^rescence in ^ »^ ^^ygote, and 
probe pair sKowing the presence of nf^ (,^^^^^^ P ^ 

plied prior to the ciuantitative genotypmg anaiysj. 



and reverse suands. Nine multiplex PCR and 
SBE reactions were designed with 9-28 inaxk- 
ers in each set. Of tHe 163 SNP markeis 
tested, 21 SNPs (12.9%) were further elimi- 
nated' because they consistently produced 
poor signails in multiple samples tested. 
These failures were systematic, and weie the 
result of poor amplification in the multiplex 
PCR or SBE reactions, or poor hybridization 
behavior on the array. It has been shown 
previously that roughly one out of 10 tag 
sequences do not hybridize sufficiently well 
on arrays of this type (Winzeler et al. 1999). 
Although these SNPs may be rescued by 
primer or protocol changes, repooling, using 
the opposite strand extension primer, or 
simply Unldng the primer to a different tag 
sequence (from which there are many to 
choose), we have not attempted further op- 
timization of these 29 (S+21) markers. The 
remaining 142 markers in 62 genei were 
used in subsequent genotyping expenments- 
The 142 SNPs used, the genes Involved and 
other details of the polymorphisms, and the 
designed primers are listed in a table located 
in the online supplement (note that the fir^t 
20 bases of the SBE primers listed in the table 
ate complementary to the tag probes on the 
array). Additional Information on these SNPs 
can be found indbSNP (httpT//v^^vw.ncbi-n- 
lm.nih.gov/SNPO or at http://genome.cwr- 
u.edu/candidates/snps.html3) (Halushka et 

al, 1999). 

To test the reproducibility of the TAG- 
SB£ assay, we performed the multiplex PCR, 
SBE reactions, and the array hybridization 
experiments in dupUcate for four indepen- 
dent samples. A high correlation between 
the hybridization signals of the replicate 
measurements (R^ ^ 0.92 for fluorescein sig^ 
nals and R^ = 0,93 for phycoerythrin signals) 
was observed for the 142 SNPs. More impor- 
tantly, there were no discrepancies in geno- 
typing calls between the duplicate measure- 
ments. . 

We next used tag arrays to obtam tne 
genotypes for all 142 SNPs In 44 unique DNA 
samples. Hybridization signals sufficiently 
above background were obtained for 96,5% 
(6029/6248) Of the 624S (142 x 44) possible 
calls , Based on the two-color signal intensity 
ratios, distinct genotype clusters were ob- 
tained for -S0% of the maxkers (Fig, 3). We 
used a combination of automatic software 
analysis and blind manual editing to assign 
genotypes for all 142 markers in the 44 



Genome Research 855 

wvvw.genorrie.org 



PAGE 21133 ' RCVD AT 3/412004 8:30:31 PM [Eastern Stand^d Timel * SVRiUSPTO-EFXRF-DO ' DNIS:8729306' CSID:401!7315392 * DURATION (inm-ss):14-00 



Mar-04-04 06:37pfn Froni-Af fynetr ix, Inc 



408 731 5392 



T-9S4 p. 022/033 F-222 



Fan al. 



3^ -1 



5 3.a 

e 



a 



CD 

2 1.5 



TC 



» w 

♦ 



CC 



0.0 



0.4 0.6 



0.8 



—I 
1.0 



— 3.0- 
"fl 

g 2.0 

D 



GG 



1.0 



0.0 



0.8 



1.0 



0.2 0,4 0*0 

RslM'tve Alteic Fraction (^) ' 

Figure 3 Ouster artalysis of «3^i^ay ^/'Tili'^S "sEtTzf Tlte 
44 individuals at SNP marker (A) APJPex3.33 and (© SELP.25.The 
Innarithm of total fluorescence intaisity KPM-M^/O.K-,-c.,J 
™?? 1 for each of the 44 hybrid «atlor»s is plotted 

Taains^theCTa^t^a r^atte Siele faction value P. Th. three 

d^n« c^%e« observed «n^?Pf «? n^^^^d ior 
and C/C for marker ANPex3.33, and T^^, T/G, and o/o ror 

marker SEI.P.25. 

samples. For five of the six SNPs that Had both forward 
aiul reve«e SBE primers, identical genotypes ware oh- 
teined from both strands in aU 44 individuals (i.e., 
complete concordance In P^r.d iests)^For one 
SNP (DCP1EX13.138), clear hybridization results were 
obtained for the forward primer, but the resists were 
inconclusive for the revene SBE primer and therefore 
calk for that strand were not made (i.e., one strand 



Yielded clear results while the other produced a no 
call"^ In no cases did the two strands give contradic- 
tory results. This experiment Indicates that either 
strand (or both) can be used for TAG-SBE analy^s of 
the maiority of the markcirs, and that for some marker, 
TneTtrLnd Ly be more informative 
described above, these assays were not fully op««^^ 
and we anticipate that it is possible to increase the 
overall genotyping yield further. 

TO determine the accuracy of the method, we used 
eel-based DNA sequencing to deteimine the genotypes 
of thiee individuals (a subset Of the 44 persons studied 
earlier) at 133 locL Comparison of the 355 paired gel- 
based and TAG-SBE genotype caUs showed a total of 1 7 
discrepancies involving seven different markers (see 
Table 1) a 4.8% discordance rate- Some of these dis- 
crepancies involved cases where one method made a 
homozygote call while the other method called a het- 
erozygote. But there were also cases in which the gel- 
bas^ sequencing and array-based genotyping yielded 
opposite homozygote genotype calU; we suspect sys- 
ternaUc misptiming of the SBE primer to adjacent simi- 
lar sequences as the likely cause of the discrepancy 
Designing an SBE assay using primers for the other 
5trarid may be sufficient to solve the problem in most 

'^^The quantitative nature of the two-color TAG-SBE 
measurements suggests the possibUlty of using pooled 
DNA samples to estimate allele frequencies and screen 
large numbers of lod for aUele frequency .diK^^^ces 
benveen groups of phenocypically distinct mdividuals 
(Shaw et al. 1998 for microsatelUte markers; Syvamen et 
al 1993: Hacia et al. 1998 for SNP markers). To test 
tliis, we first synthesized two artlfldal SBE templates 
and performed controUed mixing experiments. As 
shown in Figure 4, the intensity ratio of the two fluo- 
Tophores and the template concentration ratio arc 
hichly correlated over a 100-fold concenttanon range. 

further tested the TAG-SBE assay performance with 
pooled DNA samples. Genomic DMA from five, 10, and 
20 individuals with known genotypes was pooled and 
treated the same way as the DNA samples from mdi- 
viduals in all subsequent PGR amplificanon, SBE reac- 



■ ' ] ~Cel-ba»d sequencing Array based assay 

• 11 , 0048^9 90596 904957 9<M896 9 04957 Discrepancies 
SNP name m allele Mutant altele 904S89 905?6 
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tion, and chip hybridization ^^^P^ f Jj^^^SlJ^J 
observed allele frequencies were related ^i^^^'y ]° 
Se values expected based on the ^f^^ «f °^ 
tne individual, in P-^28^„S> nd reut^vej^^ 

r/de'^eTrot '^aS -Sr"^.^.rate^ 
be L^d to estimate allele frequencies in 90P^^^^^ 
and to scan large numbers of markers J^' 
frequency differences while greatly reducmg the num- 
Sof iS^vidual measurements required for assoca- 
tton studies designed to detect genetic ^»«««"«/ 
Seen groups of individuals with phenotypic differ- 
entia The minimum detectable allele-frequency 
d?ff"encL a^d the maximum number oi ^r^^^ 
that can be genotyped in parallel xemam to be deter- 

""'"Slr approach combines the parallelism and fl^ 
Ibility of a standardized high-density obgonudeot.de 
Sy readout vath the enhanced fidelity of 
primer extension reactions. O.ing a standard array of 
StTc tags eliminates the need to design and jnanu- 
Ke citom arrays for specific sets of markers « 
only the PGR and extension pnmcrs 
toi4sed. Furthermore, the tag-based approach uses as 
few as one or two oUgonudeotide probes per marker 
rather than the 56 probes used P'T^^^f^ '^'^ 7"^! 
detector aixays (VDAs) (Wang et al. 1998). The swn^ 
dard tag array could also be used l^^^^^^f^^^f^^ 
other genoiyping approaches mduding multiplex ou 
glnucfeotidVligltion assays (OLA) (O^lf unty et al^ 
1996- Tobe et al. 1996; Chen et al. 1998), invasive 
cleavage of oligonucleotide probe assays (Lyamichev et 
S 1999), and allele-spedfic PGR methods (Newton et 
al. 1989; Lo et al. 1991). 
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S[^r%^™^ran6 colccii^n, «^ multiple individuals ,t 

DMA from a group pf 5, 1 0, and 20 individuals (C-allde V^'^.'^*" 
deTof 0 10, 0.28, and 0.1 S, raspectlvely) was pooled jn equal 
amount and treated in the same way as the samples from single 
Sduair^ll'obsa^ed .llele fraction value P P|^«d »9a.nst 
the known C allele frequency, along with ths 'J^^J'JJ'"' " 
«,\irfp to the evs The line intercept* the Y-aws above the origin, 
!nd th^syiSic offset Is the result of a small amount of cros.- 
h^briSnand misincorporatlon of *e wrong base -n *e^- 
colof SBE reaction. A correction can be applied to the aata loi 
S^ng t^e -observation of pure 9,f ""^P" " "'"^^ 
curate estimate of the absolute allele freeiJencies- 

The experiments described here used only a small 
fraction of the 32,000 tags synthesized on the array 
and have not taken hill advantage of the multiplexins 
possibilities, our previous experience with developing 
highly discriminating sets of oligonucelodde probes 
for yeast gene expression measurements and genotyp- 
injr suggests that it should be possible to use a large 
frartion of the 32,000 tags on the array in a single 
exoeriment CWodicka et al. 1997; Winzelet et al. 199S). 
A fet of three Such arrays would allow the detenmna- 
tion of nearly 100,000 genotypes. The current array 
was synthesized using 30 )un feamxes on an 8 X 8 inm 
chip. A single, 12.8 X 12.8 mm atiay with 24 pm fea- 
rures could interrogate 128,000 SNPs at a tiirie. Phys - 
cally smaller arrays with fewer tags may also be usehil. 
scaling down the array size to 2 x 2 mm, an array con- 
taining 24 um features could encode over 3000 tags 
and acconunodate many important genotypmg appu- 
cations in which more marlceis may not be necessa^ 
In addition, multiple sets of tags can be assoaated with 
each locus-speciflc extension primer in separate reac- 
tions (pooled for hybridization). In this manner, a 
single array could be used to analyze the same loci 
from multiple individuals at once. 

The highly parallel nature of oligonucleotide ar- 
rays and their ability to interrogate complex mixmres 
of nucleic acids enables significant flexibility m the 
desien Of genotyping assays- Simple calculations sug- 
gest mat the cost of amplification and labeling reac- 
tions can be a significant barrier to the broad use of 
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large-seal* genotyping methods- The ^f^^^^^^'^ 
preparations demonstrated here permit sign^wxit rc- 
Sons in reagent use. "mus, multiplexmg bom spe- 
cmc genomic amplifications and SBE reactions reduced 
tS4 reactions needed for the X42 SNPs to only l& 
reactions. This 16-fold reduction can be ^^^^^ "^ 
pooling strategies. The current scheme uses two colors 
requires separate SBE reactions. The use ^four 
colors would allow a single-tube reaction, -with associ- 
ated increases in efficiency and reduction of genotyp- 
ing costs. 

METHODS 

Sample Colleaion and DMA Isolation 

DNA samples from 44 individuals were cOUect»d as part Of the 

mSrtitute Family Blood Pressvra f'Sr-"'- ™e ^^»;;'8 
scheme was designed to ascertain nudear fcioilies through a 
hy;^tonsTv , pilband. Samples were collectod under In- 
foimed consent and IRB approval at each Of two centos 
i^Tecumseh, MI and Maywood, IL- DNA was e«ra<rt«l from 
Stffy coats isolated from S to 10 ml whole blood j^ng a 
stai^rd saltins^at m«hod and th« PureGen^ »^ (f"''" 
Systems). For me pooling experiinonts, genomic DNA from 
fiVe, 10, and 20 individuals was pooled in equal amounts and 
treated like single DNA samples In subsequent PCR amplifi- 
cations, SBE reactions, and chip hybridizations. 

Primer Design 

For each SNP, primary PCR amplification primers w^ da- 

signed as described previously C^ang et al. 1998). Th« SBE 

primers w«e designed so that the 3' end t^«« 

before th, polymorphic site- The Prime. 3.0 ^^'fT^A" Twa 
www.genome.wi.mit.edu/cgi-bln/primer/pnmer3.cs^) was 

modified and used to pick SBE primeis at a predicted length of 
20 nucleotides (taage; l&^6) a.id melting ^^'"P'?™!' 
S7'C aange: S3'C-64-q. SBE primers weie picked ^^ ^^ 
fotwa d toection first (i.e., 5' w the SNP), "r^Tdfo^n 
Son being ^ed when a suitable prime* could not be chosen 
for the forward direction. 

Multiplex PCR 

Specific amplification Of the genomic regions contaimng the 
142 SNPs was achieved with nine 

each conuining SO r>g of human g«noimc DN^ 0.3 IjM °f 
each primer, 1 mM dooxynucleotide «'P?^<^P^**" ^'''^^'i 
10 mM Tris-HCl (pH 8.3), 50 mM KCl, 3 mM MsCU and 2 
units of AmpliTaq Gold (Perkln Elm«) In a »f 7?/"^^°^ " 
ul. PCR was petfotmed on a Thenno Cyder (MJ R«eatch) 
With initial dcnamration of the DNA template «jd Taq eo- 
zyn.e activation at 96°C for 10 min, followed by 40 ^1"^^ 
^»tu.aUon at 94"C for 50 sec, 5rC for 40 sec and 72 Cfor 
90 sec. T^iti final extension reaction waA at 72 C fot lU imn. 

SBE Template Prep^rarion 

One lU of Exonuclease I (10 U/pl, Amewham Life Science) »rid 
1 ul of Shrimp Alkaline Phosphatase (1 V/]x\ AmeishAm Lite 
science) were .ddcd to 25 pi PCR produc^ »nd incxib^ted at 
37"C foi a ht. The enzymes were Inactivated at lOCTC tcr is 
min The enzytnatic^Uy treated samples were appUed to an 



S^OO columri (PhannacU) to further "^^^./^^^^^^f^ J''^^ 
primers and ^iNTPs. The buffer was replaced with ddHjO. 

Multiplex SBE Reaaion 

SBE reactions were carried out in 33 pi reaction$ using 6 of 
the template (see above). 1-5 aM of each SBE pnm^r 2^5 Un^ 
of Thermo Sequenase (Ameisham), 52 mM Tns-Hp (pH 9.5), 

gland NucUaO, 7.5 P^ biotlTx-N6^^UrP or biotin.N6-d--CT 
or 3 75 uM biOtin-N6-d-<lATP, imd 10 pM of the other COld 
ddNTPs- Extension reactions were carried oat on a ™nno 
Cycler (MJ Rwearch) with 1 cyde at for 3 min. th^^n 45 
cvcics of 94'C tor 20 sec and SS^C for U sec. After SB£ r<:ac. 
tlons the products of the nine reactions from each sample 
were combined ^nd mix^d with ^ pi of 1 00 pg/mi glycogen 
(Boehringcr Mannheim), 1&.7S pi of 8 M UCl (Sigma), and 1.1 
ml of prechilled t-ZO'Q ethanol (200 proof), and precipi- 
tated ^ centrifugatlon (Eppendorf centrifuge 54150 for 15 
niin at room temperature; precipitated sampl^were dried at 
40*0 for 40 mln and fesuspcnded in 33 pi ddH-O- 

Tag Array Design and Hybridization 
For each tag sequence, two probes were synthesized on ilie 
array- one matches the designed-tag sequence exactly (?M 
orobe) and the other being identical except for a single base 
difference in the central portion (MM piobc). The misTnatch 
probe serves as an internal coiitrol for hybridization speafic- 
itv and enables effective subtraction of backsJ^o^nd and cross^ 
hybridization signaU. Over 32,000 20-mer tag P;°^« 
their mismatch partners were chosen (Shoemaker ct al. 1996) 
and fabricated on 8x8 mm arrays. Each probe (feature) ckcu- 
pies an area of 30 x 30 pm. which contains -10' copies of the 
chosen ZO^mer oligonucleotide- Sets of lOO aixays were syn- 
thesized together on a single glass wafer. 

The labeled SBE reaction products vrere denatuitd at 
gs-C-lOO^C for 10 mln and snap cooled on ice for 2-5 min. 
The tag array was prehybiidi^ed with 6x SSPE-T [Q-9 M N.O 
lo nSf NanlpO. 6mM EDTA(pH 7.4), 0,005%Triton X-lOO] 

and 0-5 mg/ml BSA fot a few "^i"^^^*' ^y^'^S?!^,^'!^ 
120 ul hybridization solution (shown below) at 42 C fot 2 hr 
on a rotisserie (at 40 RPM). The hybridization solution con- 
sisted of 3M TMACl (tctramethylammonium chloriae), 5U 
mM MES [Z-CN-morpholinoethanesulfonic 9cid) sodium saitj 
(pH 6.7), 0.01% of Triton X-100, 0.1 mg/ml of herring spertn 
DNA. 50 pM of fluoiescein-labeled control oUgo, 0.5 mg/ml of 
BSA (Sigma) and 29.4 pl-labeled SB£ products (see above) m a 

total value of 120 pi- ^ ^ . ... . _ 

After hybridization, the atiays were nnscd twice witn i x 
SSPe-T lot -10 sec at tootti temperature, then washed with I X 

SSPE-T for 15-20 minutes at 40^C V.^^^^P^T^t ^2^ on 
^ere washed 10 times with 6X SSFE-T at 22 C on 

a fluldlcs nation (FS400. Affymetrix) and then st^"^^ a 
room temperature with 120 pi staining soluhon [2,2 pg/mi 
streDtavidin R-phycoerythrin (Molecular Probes), and 0,5 ing/ 
ml acetylated BSA. in 6 x SSPETl and mixed on a rotlssenc for 
15 min at 40 RPM. After staining, the arrays were wajhedio 
times with 6x SSPET on the fluidics station at 22 C- The 
amivs were scanned on a confocal scanner (Affymetri^r) and 
fluorescence at 530 nm (fluorescein), and 560 ntn (phycociy- 
thrin) was collected with a spatial resolution of 60-70 pixels 
p,. feauire- GeneChip software (Affymetrix) was us<»d to con- 
vert image files into digitized files for further data analysis. 
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Genotype Determination 

For a given marker (at a given tag piobe P°"'^;°'^ 
,«certce intensity oE each of th* 

MM feoin that of the PM; negative values of PM-MM wert 

"eaters zero. BeCuse of the P^^^" f "^^^^n 
spectia of th» two nuorophor«. a fraction of the fl»f "^f^'" 
kLial (7.6%) was subtracted from the signal s«cn bi the phy- 
cSrin channel (Hacia et al. 1998). A metric P which ^- 
S« the relative amount of each allele W 'he target m-X- 
X was computed a5 th« relative proportion °« co^^rted 

jT^tensitiW [nuo«=^"l"/(^l'^°'««='lfrPT:^^"^IV™lues 
fine genotype clusters for each SNP (see Figure 3), the P values 

arseSatedwith each sample were sorted, and ransos corr^- 

sSnVto the thr.e SNP genotypes were computed usuigan 

a?goiithm based on empirical obseryauom '"O",^"'^ 8^^. 

typtas experiments. The purpose of this ^go'^*"* 'ff"* 

«fy\7ell-separated ranges of expeiini^ta 

spond to distinct genotypes. The speofac al^nthm employed 

h«e used the follovrtng mles: CD At m^t f°^^ 

CIS) about 10% of the total data may be excluded from the 

Computed ranges; (2) each pair of ranges must extend over an 

area of £0-3 and all three ranges must extend over (5) 

^dividual ranges must be separated by a gap of a O-l: C^> 

width of a single rang, may be =S0.4. A "S~<»"';« 

statistic computed as 1 - (sum of range 

range) - (number of Outliers / JO) was roaxnmzed for the set of 

ranges chosen. 

Quantitative Allele Analysis 

Two templates, template-T (S^TGCTGAATATrCAGATTCTO 
XAGTGCTACCTGAAAGATCCTG-30 and t^Pl^^^^ 

TGCTGAATATTCAGATTCTCOaGTGCTACCTGAAAGA i u- 

crG-3') were .syti the sized- They were identical f^^^^^^P^^^^ 
.jngie CZl.'*) position: T in tcmplate-T, and G m tenjP^^*^-^- 
Thitwo teiDplatesw^c mixed in the ratios of 1 nM/10 nM. 1 
nM/3 nM. 1 nM/1 nM, 3 nM/1 and 10 nWlnM. respec- 
tively. The ^^^^o^^^^nv^J'i^^^^ 

CTTTGCCGTCAGGCAGGATCTTTCA^^ 
5'-GGCGAAGTTCCTCTAGTGTTCAGGATCTTTCAG 

GTAGCACT.3', S'-GGCCTCGGTGTTCAGCATA^^^ 

aATCnTCAGGTAGCACT-J', ^'-^S^^^^^.^^^TrS^^^ 
tACCCAGGATCrrTCAGGTAGCACr-3', S'-TGCATTGATT- 

TAACTGCGCGCACGATCTTTCAGGTAGCACT-3 . 7.^^^ 
add^fd separately to fwe SBE leactlons containing *f JTiy^ 
types of mixed templates. The SBE primers wece extended m 
the presence of biotin-labeled ddATP and fluorescein-labeled 
ddCTP, pooled, and hybridized to a tag airay. 

Gel-based Automated DNA Sequencing 
To independently confirm the genotypes called using the 
TAC^SBE a"^y, three samples (954957, 904896, and 904889 
were sequenced for 115 SNPs from the t*blc m th« onUne 
supplement, u^ing conventional gel-based methods Samples 
were ampUfled for all sites with T7- and T3-tagged primers 
x^lng standard PGR cycling conditions [2,3 f ^0^^^ 
DNA. 0.375 jil of 20 uM primer (X2), 1.5 pi Of 10 x PCK 

U/Pl Taq DMA Polymerase (Sigma), in a total volume of 15 W 
Zl^^ d?H,0], Some products were sequenced direaly whi = 
other* requUed an M13 nesting strategy because of the close 
proximity of the polymorphic base and primer end. Samples 



from the initial amplification were diluted 1;50 with ddHtp 
^d amplified -^^^^^^^^^ 

I^gctatcaccItgI^^ 

3') primers uilng standard PGR conditions. All PGR V^o^^^l^ 
were cleaned wfth Exonuclease I (Amersham O.U pl of 10 
U/ul per well) and Shrimp AlkaUne Phosphatase (Ametsharn, 
0.30 pi of 1 U/m per well) in a volume ^^^^y^^^^^^ 
nator sequencing using an M13R primer (AACAGCTATGAC- 
CATG) or T7 prSner CTAATACGACTCACTATAGGGAGA) on 
an ABI377 (Perkin Elmer) using Big Dye (PftrUln El^")^?^ 
performed to determine the genotype stams for each SNP m 
each of the three Individuals. Trace files were read with Edit 
View 1 -O (Perfcin Elmer) software. 



ACKMOWLEDGMENTS 



We thank Drs. A. Weder and K Cooper for DNA sample col- 
lection, M. Mittmann and D. Shoemaker for tag selection and 
array design. D. Stern for constmction of atray scann„s used 
in this study, K- Bentley for DNA sequencing, and K. Gun- 
derson for helpful discissions. This work was supported 
by a giant from the Advanced Technology Prograrn of 
the NationiLl Institutes of Standards and Technology 
(7ONANB5H1031) to Affymetrlx, wid research funds from 
Case Western Reserve Univ^risity, University Hospitals of 
Cleveland, the National Heart, tung, and ^^^^ 
(HL54466), and the National Institute of Mental He^th 
avlH60007) to A,C- This research is a contribution Of GenNct, 
a network of the National Heart, Lung, and Blood Institute's 
Family Blood Pressure Program. ^ - 

The publication costs of this ariicle were defrayed m part 
by payment of page charg'is. This article must therefore be 
hereby marked "advertisement" in accordance with IS Ubc. 
se^^on 1734 solely to indicate this fact. 

REFERENCES 

Ciu^n, M., D. Aluhulci, J, Ireland, F. Sklar, K. ArdliC, N Fiitil, C,R. 

L&oe E.P Lim, N. Kalayanaraman, J. Nemcsh « al. 1399. 

Chstracceiization of single- midcotiUe polymorpftiims In coding 

regions of human gcncS, Macwre Genet. 22: 231-238, 
Ouikrivairl. A. 1999- Population ficnetics-maldng sense out of 

SfeOMencc. l^»(ure GcncL. 2li S6-60. 
Chee, M., R. Yang, e. Hiit.t>ell, A Bcmo, ^.e Hu«n^ f - J- 

Winkler, D J, Locthnrt, M-S. Morris, and S.P. Fodoi- 1996. 

Acccsjing genetic Lnfonnadon «ltli high-density DNA Orrays. 

ScfcrrcF 274: eiMl'f- , , , ^ 

Chen X andPY. Kwok 1997. Templatc^i recced dytstermlnator 

inL-f>™radon (TDD assay; a hprnogcneow DNA ^^E.^^'^^- 
method based on fluorescence rcsonanof cncrsy transfer, leucine 
ACid^ Rss. 25: 347-353. 
Chen, X., B. Zennbauei, A. Gniike. and P.Y. Kwok 1997. 

Fluorescence energy ti*t^Jf« detection 3i a homc^^ieous DNA 

Chen, X. KJ. Uvak, und P,Y. Kwok. 199*- A ^^^^^^^""1^.., 
Ujyase-mediated DNA diagnostic tesr. Oenomc R«s.^ 

CoUins, F.S.. M,S, Guycr. und Ch.kravartl. 1997. Vsrtatlom on a 
theme: CaUiloffinj human DNA sequence vaiuition- Science 

Collins, A- Patnn^s, ^Jordan, A^ Chaicravartl, 

L Walteis. 199a. New goals for the U^. Humaa Genome Project. 
1998-2003. Selena 252; 682-^89. , „ ^ xn i 

Pelohunty, C, W. Anl«:Aer, Q- DcrnS, J. Bng, anU D.A NlckerSOO. 
1996 Tescing the feasibility pf DNa rypins tor human 
Identification by PGR and oil oligonucleotide ligation assay. Am, 
;. ffwm. Cemrt 5»: 1239-1246. winrcW A. 

Glacver, Cr D.D. Shoemaker, T, W.Jones, H. Uang, £.A. Wlnzder, A. 



Genome Research 

\AAww.genome-Qrg 



859 



PAGE 25/33 * RCVD AT 3/412004 8:30:31 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-1IO * DNIS:8729306 * CSID:4087315392 * DURATION |inm-ss):14-00 



r-04-04 06:38pfii From-Af fymetr ix, Inc 



408 731 5392 



T-964 p. 026/033 F-222 



Fan 



uenslty oUgDniiCleotUie arrays 3nd energy trtASter ayes, i^au^ 
Adds Fex. 26: 3afiS-3866 , „ . ^ * v/edcr 

Irl^iusriKa, m., j r-KsVx-^Marci 199S>. Patterns of smglc 

pressure p 

?c^ r^ta^n detection ia ibc p53 tumOr suppr^ox Rcne. NucIcnC 

Gtoce. and S.P. Bajiii. 1991. Single nuc eoUdc pnm« cM^on 
to detect Ec^etic diswseii oxperimental appUcation to 
S^opSiL B (factor and cysdc fibro^ gene*. Pr^. Natl. 
Acad. $ci . Sft: 1143-1147. m t .^kh«t 1999 

High dcisity synthetic oUfionucleoiid* ^y^^ ^'^t^ <^^^ 
Lo V M^^^'atd. Cft, Newton, A.F. M«klumi, KA. H^^mlriS. and 
/SMsfs^ficity. sensitivity and gcn^^tic applIc«iQns. 

taVova. R.W. Kvristkowski, T.J. Sander, M. de Axmda. D-A. AiCO 
« al 1999 Polymoiphism identification and quantitative 
detection of genomic DNA ^>7 invasive ^J^^^f 
oUgnnuclcotide probes, mturc Biotech. 17; 29M96. . 
M«rton CR A. Grahzizn. L.E- HcptinstaU. SJ. PowcU. C Summers, 

any point mut^Cion in DNA. Th. amplihcatit^ ^^STft 
mutation syrtem (ASMS), N.^c/rrt >l<:/rf5 RcS, 17: ^/>^516_ 

Andcrwn, QX. Trainor, MidM.R Knapp. 1994- G<=n«ic^^* 
Cnaly^s; a .oUd phase method Wi"? S^^^^^^^ 
polymorphisms. Nucicic Adds E^s. 22: 4167^175^ 

PasrincL T A Ku«, A. Metspalu. L. Peltoncn, and A.C. Syvancn. 

'^^1^. MW-cin«; A ^Tccific tool ^ ^^^/-^V;^^,^^,, 
diasnostlcs on oUgonUCleotide dffays. Gif^omi RcS. 7: 606^14. 

V^nJ^X P^'lton^n, and A.C. Syvanen. 1998. Arx.y-based 



multiplex analysis of candidate g*nes icveals two independent 
and additive genetic risk fectots for myocardial infarction in the 
Rnniih populi»5lon. Hum. Mot, Oj»et. 7; 14S5-1462 

Rij^h N and K Mefikangas. 1996. The future of genetic soidics of 
complex human dl5ftast:S. Science 275s 15 16-1 517. 

Shaw, S-H. MM. CartasqulUO. C. Kasliuk, E.G. ^'^"^"^"S^VJ'i^Mi 
ChakravartL 1996, AUde frequency distributions in p£>oJ«d DNA 
samples: applications to mapping complex disease genes. Gmom* 

Sho^aSr! D.D., X>J^ LaSWcait, D. Monls. M. Mirtaiann, and R^. 
Davi* 1996. Quantitative phenotypJc anolysii of yewt delc&on 
mutants using a highly paiaUel molecnlar bar-coding sixatc^. 
Nature GencL 14: 450-456- 
Syvanen. A.C. 1998. SoUd-pime ^^^''^^^^^ ^ ™' ^ 

DNA polymorphism. Methods Mot. &ioL 95: 291-29S. 
Svvancn. A.C-, K. Aalto-Sctala. L. HaiM, K- Koutula, and H J\. 

5odcrlund. 1990, Ptlma-gUlded nucleotide incor^xation assay 
in the genocyping of apoUpoprotcin £. C^nomics 8: 684-692. 
Syvancn. A-C.Tsa|antiJa. .nd lA. LuKk;^1995. IdentlfkaUOn Of 
individuals by analysis of biallelcliC DNA markers, using PCR and 
soUd-phasG minisequcncing- Am. J, tiwn. Gcwt?:. 52: 46-59. 
Tobe V.O., S.L TaylOi, and D.A. Nid^etson. 1996. Slnglc-vveU 
ccnoeyplng of diaUclic sequence variations by a two-color 
EUSA-bascd oligonucleotide ligation assay. Nucleic Adds Res. 

Wang, D.G., J-B. Fan, CJ- Slao. A. Bemo, P. Young, R;SapoW^, O, 
GhandOUr, N. PtMklns, E. Winchester, J. Spencer et aL 1998. 
Large-scale identification, mapping, and gcnotyplng of 
slnile-y^uck^tide polymorphisms in the hmnan genome. S<^ler«:E 

2S0: 1077-1OS2. * , ^ c 

Wlnzcler, £J^.. D.R. RlchaidS, A.R. Conway, A.L. fo^^f^' ^' 

Kalman, MJ. McCuUOugh, JM. McCuskm. D.A. StavcnS, L. 

Wodickk, D.J. Lockhart, and R.W. Davis. 199». Ol«ct allehc 

variation scunnitig of the yeast gi^ome. Science 281: 1 194.-: 197. 
Winzcler, ^A., D.D. Shoemaker, A. Astiomoff, H. Liang, K- 

Anderson, B. Andre, R. Bangham, R. Benito, J.D. &oekc, H. 

Bussey et at 1999, Functional charactenzatlon of the 5. ccrvv^stae 

genome by gene deletion *nd paraUel analysis. Science 

285: 901-906. . , _ . . . 

Wodlckn, L,, H. Dong, M. Mitttnann. M.K. Ho, and DJ- Lockhart. 

1997. Genome-Wide expression monitoring in Suecharomyeejt 

Ccrrvistae. NaL istotechnuiosy- 15: 1359-1367. 

RcCfHvfd January 5, 2000; accepted in revised /brrrt MoTch 29. 2000. 



660 Genome Research 

wvuW- ge nO iY\e .O r g 



26/33 • RCVD AT 3/412004 8:30:31 PM [Eastern Standvd Time] » SVR;USPTO-EFXRF-110 * DNIS:8729306 * CSID:4087315392 * DURATION (rnm-ss): 14-00 



Mar-04-04 06:39pm Frora-Af fymetr ix, Inc. 



408 731 5392 



T-964 P. 027/033 F-222 



article 




Quantitative phenotypic ar^'y^i?;^!,^ 
yeast deletion mutants using a highly 
parallel molecular bar-coding 
strategy 

cruel D. Shoeo«a»r.. A. UAfau:,". Do. MomA Mte Mit,™n„' & 

Ronald W. Davi5^ 

Apuantnative and highly parallel J^^^^^^^^ 

developed to aid in determining the b'o'og'Cfi wnCTon ^^^^ approach uses a PGR 
open reading frames (ORFs) Ss^^'^^':?"^^^. S'^n S ^ch deletion .train is 
targeting strategy to generate '^^^^"'"^^"^Jit^^^ ^ hybridization to a 

l^ellad with -,-nique 20-basatag -^^^^^^'J'^^^^^ identifiers (molecalar bar 
high-density oiigonuc eot.de ^I'^^'^^^^Zl^^an stains simuitaneously through 
codes) that allow analys.s of '^^^ J^f^^^^^ show that the arrays are specif.c. 
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The recent release of the SOmplcW nudcoUde "JUEnce 
of c*«vi«a« provides i^*^*";!'* 

identities of dl die Rcnes in this basic 
However, f«>^.:tional infonD»tion is ^^f^^ 
half of the estimated 6,000 f>-^f ^^"'''^^^^^^ 
loeical fimctioa of the thousands of jwes discovered by 
thTyeast Uad other) E-^ome sequencing P^J««« 
of the main challenges fkdng r^earchers at we enta the 
post-geoomc era, Detemming when a gene ut ' 
wherl the gene product is localized m the 
happens when the gene U disrupted are a few has^C 
«p4ne..ial q^c^ons that need to be addressed fo^e 

large number* of newly discovei^d genes. There ^ * dejr 
for new genome-wide approaches th»t Pr^c 
ans^vers to these question* in a rapid and cost elective 

'"^"disn.ptioc is a powerful tool for i^«n.min5 
the biological function of proteins <^'^]'y 
*eteriied ORES. Given the sequence of a yeast Oj"- « " 
possible to generate a precise nuU f ^^V^^"^^* f 
&om start wdon to stop codon)*-'. Information about 

Sie^iological function >'V''^^^XavS 
the fitness of the resulting deletion strain »«der a variety 

of selective growth conditions (for «»»Pl«- «P°'^! 
to UV Ught). This approach has beea 
tivc for investigating the fuacuon of Xf^^' J/*?^' 
tunately. the time and Ubour '-^°l-^'=XoS 
individiial deletion strains will mike it difficult to appiy 
this approach on a senome-wide lev<d. 

Stiatesies have been reported in which gene disrup- 
tions* aS selections are performed en ma^t W'^E r«i- 
doxBly integrated transposons*. These methods ofler * 
Smitic i^rease in the rate of analysing null mutaucas 



compared vith the traditionai single gene approadi. In 
addiSon, these novel approsAes provide valuable mfor- 
roation «.ncer™iS Z^^^ expressions gene product locel- 
ization* and the functional "^^^'^^^"^^ ^ 
However, using these technique, thousand* of 
rtiust be individually e»wimed foUowing ea^ selectt^ 
growth cotidition to determine which mutaiit, fe«ve 
olaved modified grovrth eharacterisucs. This labour- 
fS^^analysi. step will make it difficuh to test a Urge, 
number of selective conditions on a gcnome-wide level. 

Some o£ these difficulties have been overcome by a 
method in which transposons containing ^^^^^^^ 
are used to generate insertional mutants* The distm- 
EuishJnR t.M allow collections of mutants to be pooled 
tnd an^ysed in parallel through selective growth assays, 
^is approach has the advantage that all thr« steps 
(gene dteruption, selection and analysis) P«*»'?^=^ 
to paralld. Unfbrtimately, only 96 straias can be analysed 
S a single pool due to Umitatiotis of the filter-based 
bybridiStion assay*. In addition, die rardoin nature Of 
S^^Smposon-baked mutagenesis requires clomng arid 
DNA sequendi^ to Identify OM>s that are disrupted m 
strains with interestinB phenotypes. ,„_,d 
Here, we describe two novel approaches termed 
"quantitative phenotypic analysis" and tnolecu ai bar- 
coding." in which directed gene replacement iS used to 
gcflerie individual deletion (trains that are eadi labelled 
ItoTdbdnguishing molecular Ug (unique 20 b«e-pa.x 
sequence). -Digged strains are pooled and m 
nakuel through selectWe growth conditions. The level 
at which each strain survives a given eompentivt growth 
condition can be determined by hybridiaiug the t^to 
a high-density oligo'iucleotide array. This approach has 



nahjra genBtics vduniB 14 deeamtjot iM* 
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Coneantration CpW> 



four main advantages: i) Urge P^^"? °f "BB«d dde«on 
.trains (pote,itiaUy >6.000) «n t=.*"?^^Xdrft^ 
analysed because of The increased senSjtmty affojd=d 
{luo^cent detected*:! on high-density arrays; d^^^^ 
sUius have OBPs mm plataly deleted. "^8 f'"dua 
or altered funetions associated with trun^tcd P'od"";^- 
iiil deletion strains with inteit^sting phenotypcs are 
d ..? dentified without need fox further dcning or 
sMuencins and iv) the quantitative nature of ihe chip 
h^rSon it pcssibk to rev«l sub- 

tle ohenCtypcsAat way otherwise be missed. 
The «eps of molecular bar-coding are as foUowS: 




C30 bp] 



Downstream HftrrwIpQy 
(5Q bp) 




select YftBSt WfletinS s»qu«u:c* 
corner wg frorT list 




Flarklnfl PCR/Trana formation 



Hemofog^itM recombination 




t t 

CocTKpAi Primfr>g Site* 



i) individua: deletion stmins are S^"**'^ l^.*^^"^! 
spedfic OBf s with a selectable marker thatiS l^ted to a 
M base-pair molecular tag that serves as a unique iden- 
tifier, ii) large numbers of tagged ^el^^on s^ are 
pooled and grown competitively under vanous selearve 
conditions (for exampla, minimal medium, ecposure to 
X-rays, high salt, and so on); iii) ihe molecular ta^are 
amp ified from che surviving strains usmg a common 

:Slf primers and hybridised to a i^'S^'.-.^'^'^^S 
containins, at defused positions, knowa ahsotiudeoudes 
Sat are cLplemcntary to the t^g sequ^ices; and ^ 
iDWflsities of the hybridization *'e?»^ f^'.^/^^f 
array ate measured and used to determine the relauve 
abundance of the corresponding deletion strains m Oie 
pool- Measuring the relative abundance °f ^ " 
difierent times during sdecuve growth allows the fitaess 
of each deletion strsun in the pool to be quantit^twely 
determined. 

^^c mcSar tags are 2G base-pair 
specifically designed to serve as unique identifiers. Tag 
s^quSices art as different as possible yet Stdl letam sim- 
aar W>ridi2ation properties to facilitate simultaneous 
a^alyL on high-d/nsity oHgonudeotide ^f^;^^^ 
an J^rithm to select a set of 9.105 "^^^^^fj^ 
guished 20m«r tag sequences that art predicted to have 
ftoilar melting t^peratures (61 ± 5 *C). no secondly 
Srtc^ and no extensive similarity between any two 
tl^u^s in the list (>S misn^atches). This set of tags ;s 
sufficient for the entire yeast genome. ^ 

no 2 PCR-taiB»U09 «rat99/ u»«l to ^snerala '"OSgd daiatlon 

'pS'S'i^l^iSU* that '«<*^'«^,y^}:^s^J^^^- 

Zi liomoloay to ^'^^^^^'^^Jb^iLtsTyaaM homology 
cIMtldK is a samer thai COJ^^'^'^^^VSw olta«wdeottiJe 
and 18l>»aaa of nomology to ttio msrtwr, TTm omm i« 

(see M«hod6). A s»Cord "^nd of PCH te ^^^^^^^^^ ,o 
that are homologous o »"5,»J^I\ti produS from tho 
*5S round PCP is transforrned into a heplflldye^JJ '-J 
So^3 racomblnation remits trt tha ^P^'^^ltT 
gaSd ORF with tr>e ma/Her, 20mer la^s ^f^d primina site. 
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. — - — centration was detertsd in the pres- 

High-dens^oligonucteotide -ftbtel f>^"f crcs of tMscompUx hybridisation 

Hvbridizaiion to a bieh-denjity p oRF ;fei3«lng hybridization intewities for tht 

XnSdearriVcandettctthc ^ efficency tiJr..ed oUgonud<»tWc decreased m 

presence of sfluoiesctntlylabeUed ^^,2QBi soSa. ^%°^<.^> » ""'^ *^ m It 

~» we used a combination of phc- ^lajM59824) S7ias 50% 4^ centration range tested tl aM to 15 

synthesis chemistry to generate ^'i^^ l^gaa l009ti8/8) 

aVravs containing thousands Of ADE5^433^j 75% (Bffl) Oonsrating tagoed deletion 

oligonucleotides complementary to ^o>' (M2*Si7) 25Ma I00^t^> strains 

Sbe differemlOmertassequences^'". 7Bpz^coi3S8) 607^ 10°^ «^ We us«d a variation of a pievio<^ y 

^e. element on the an.y measures THggtO^^ '^^^^ ^'^."T'^f d^^^^on 

130 X 130 urn and contains approxi- TWP^y&^73) ^^^^ sflM (7/8) to generate the tagged deletton 

ioately 10» oligonucleotides of i ' G^„g,-H aeeeaalon strains (Fig- 2). Normany. a selecmble 

5v^n sequence'. Thelocation and ^„«„O^S,^p"^'"^t|".^rSn-'*n- ^erker {such as k»n') is ainpUficd 
identity of each sequence on the two- ™ iptaBfat'ons wsus t°|>0 a' e using a pair of long pnmers ttiat con- 

Smei«ional gjass surface is known. SUwn for »eh ot me n ai.i«mopni= ^^n yeast targedng sequences on the 
F uorescently labelled DNA is ORFs- 5' end and homology to the marker 

aDDlied to the array Mid scanning the 3' end. In our modified scheme, a 20-base tag and 

^SSfnfa miCTOScopy detects specific hybridization on ^ «^ m o incorporated mto one of 

i sTnS^-t (50 ^l^'V'^reL: "a '£uc*lKe ' PCR-amplified selectable 

Snute s^n) generate quantitative hybndi^non '';«°°^^fonned into a haploid yeast str«n and 

n^mation for thousands f "omolVgo^ "ecombinaticn results in the replacement 

TO determine the optimal length fer die tag sequences, ^ qRT Cstart codon to stop codon) with 

JjS«d and t^d ^^-^'f^^':--''i'!'^T.l 1 lrS^° r tag, and tag priming site. The intc- 

thousands of specifically f ^Sned Jrn-^IS^^^ the .^^^^ ^ fo, 

20meis (1 .025. 8.OOO, «»d 4 500 °l'£*";',^f,^"dSn Aat ddedon strain, 
rivehr) The specificity and intensity of the hybridization 

0," »nsv uid to . -Pm TAGS' S;!'"™:,^*, ,'' =««t.'>. Tibl. 1 "hov.. it i.«««>i»»oo» 

^Ugonudeotides that are complementary to ^pecific tag ^? "^'^^^^^hic deletion strains described m Table 
sequences on the array. These fluores^ntly labelled l^-j* ^„ generated by combinmg equal 

20mers should generate a 10 x l\dleck«^.bO«d ^- ^ Jdet.° «geed deletion, 

hybridizati^i pattern in the upper left-hand "rner of pi.,;^ atracted from yeast cells m the 

the array. Thfpredicted hybridisation P?"er^ was =t^^'^<l^„^f^^^^ ^e template for a tag amplin- 
Obtained'^with virtually no cross-hyhndizanoti out >de ^el«^OD P» .^^^^ amplification step serves to 
the 120 targeted array elements CFig. 1 b)- 1" this "^;°"J ^ eonccatration of the tags and reduce the 
hy^r dization «cperiment. the signal itjt^sity for some ^^^^^^^^^^^^^^ ^^pi, hybridized to the ^-H^f- 
of the tags U higher than others (Fig. 1 W- These highly f^^^^y/aed tags versus total yeast genormc DKA). 
° proSle differences reflect either the ^« Ip'^H tags were hybridized to the K^^ensi- 

dependence of the hybridization reaction and/or vana /^j^-^ ^ then washed and scanned. s^g- 

doaTn the quaUty of the 120 synthetic. labeBed oltgonu- W^^f,^^ ^„ ^tense and highly specific (Jig. 
deotides. . , ^rOnlv one out of the 131 tags tested 50 far has dis- 

To determine whether the hybridization sigi^ the ^^^'^ i^d of cross-hybridization (fig- 3. 

« o^,-<irfltrlv reflect the cOrtCcntrananS Oi Tags m a y**/ 

a tltrat^o. .«P-;---mg ye^w -ens'J p .ae a good 

the pool of 120 labelled oligonucleotides Tl.e concen^ , Jtem for testing the bar-coding strategy because the 
Ution of one oligonucleotide varied ma^n« of ^l^^mto;, ^ ^ ^ ^omp^tive growA e^«- 

«ven hybridization experm.ents "^^j' j^^^ r^latiw abundanre of these straws should be 

_» y n n rtfl TiM. 30 oM. and 1 5 pM J , While the imem, j i, culrivatEd m cOIB- 



. „ cultivated in com- 
plete medium because uic a^* — are predicted to have 
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'V'^ ^ Deletion Pool 




Genomic DNA 




PCR (Aorrtimatrk!) 



Pgpulatlon of 1 1 dIfTorant taos 



\ 



MM 





4 






. . t 1 


L ■ r 




Oalstton pool Mntauiing 11 ta$9«d deievon &^nsw*.^^^ 
template fee ma ta© ampTrficawon rBactlon. The ^ ^SSm^ 

(he oxaet location of ttiB 11 taos on *a afW Tns »ton-K mai 
eatSB B creiss-hytirltflzinn soquanoa (see taxtj. 



patterns were obtained at each of the orne points when 
Lgs were isokted from the del.tioa pool grown m 
nl^ medium CFig. 4o). In contrast, the sigiwl mtmsi- 

^rfA. aad deletion strains l^'J^.T^f I 

weaker when the pool w.s grown in rnecUu«i laclung 
adenine (Fig- 4i>). The expected depletion ^^"^ 
deletion strains {trpll^ trpi^ trp^ "T^t'^^^v^l 
occurred when the pool wa* grown m medium lading 
ttvpiophan (Fig. Figuit id shows a quanmauvercp- 
^e^tation of the hybridization d.ta from the growth 
study ift medium missing adenine. Wis measured the af- 
nal intensities of the adelAand trpSa deletion straps 
at variow tiine points during the growth study ^d ^t- 
tedrhe noTtnalizcd hybridization data (Fig. 4<i). The 
graph clearly shows that the relauye abundance of the 
n-oSA deletion strain was maintained through the com- 
oetitive growth while the orfelA deletion stram w»* 
depleted from the population (see Methods; data analy- 

si$)- 

Colony colour assay i,^i,„ 
We used an independent approach to determine whedi« 
the sisnal intensities of the tags on the air^y accurately 
reflect *e relative abundance of diSferent deletion strains 
in the pool. The fl<iel A and adeli. deletion strains foittJ 
red colonies when plat=d on medium containing hmit- 
amounts of adenine while the Other nine deletion 
Sform white colonies on this mediom. Determin- 
ing the percentage of ted colonies provides a ^f^f^' 
^rementof the relative abundance of the «delA and 
adelC^ deletion strains in the pool. We cultivated repre- 
sentative aliquots Of the deletion pool in complete medi- 
um and medium lacking adenine, and at ""^^ "f^"' 
indicated time points, plated several thousand cells from 
Se deletion pool. As predicted. 18% of the colonies 
Uk=o from L starting population (2/11 de et.on 
strains) were red. This percentage of red colonies 
Temaiaed constant when the deletion poo was grown 
m complete medium (SDC) for 10 P°f 
blings. in contrast, the percentage of red <|oloni« 
decreased when the deletion pool was grown m mem- 
^ liking adenine (Hg. 4e). Thefimi^riy b.t^ idie 
plots shovm in Figs id and 4e verifies that the hyb"d«a- 
tion signals on the array reflect the actual abundance of 
the tagged deletion strains in the pool. 



similar growth rates. On the Jiand. the 

adriA-«le5A ddetion strains or the ^ 
r,ro2A deletion Sti-ainS should be depleted fromie p^- 
ulation when the pool is ircr>m m naedia lacking ade- 
nine or tryptophan, respectively. To Z^"?!"^"'*^, 
we cultivated representadve aUquots of *. ^detiw P°°« 
in complete synthetic medium CSDQ. medmni actang 
adenini (SDCADE). or medium laclung tryptophan 
(SDC-TRP). At various time points after laoculation, 
azs were isolated from the pool and anal-;^ed onthe 
ai«y to determine the relative abundance of the dife- 
ent Lution Strain, in the pool. Simdar hybrid.i^uon 



Petermining the biological function for many of the 
newly identified yeast ORPs wUl require that thousands 
of different deletion strains be test=d undo- a large van- 
etv of selection conditions^. Molecular bar-codmg ^s 

ideal for this taskbecause it'lio^i^VB* """^f,^* ?n 
tagged deletion strains to be analysed smultaneowsly in 

a hi^ly quantitative ^'^^^^''^^''^^^'''TTaZ 
(genome-Wide) step of making the 

ion strain* wOl only have to bo P^^j^^f 
which aU subse<iuent selections and ^a^V^" 
performed in paraUel. Furthermore, the quant.tanve 

Ltare of thU approach may -^^^ll^^^^^^^;. 
subtle growth d fferenccs chat might otherwise be over 
ooked^For example, the relative abundance of » srn^e 
deletion Strain with a 5% g^ defect (doubUng time 
of 147 minutes versus 140 minut=s). grown m a com- 
petitive environment with 5.999 other d^etion strains 
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wUh wild-type gfovrth rates (doubling tim« of 140 iTiin-- 
uf^l wo2dbe rcd^icedby 30% after 10 po?nlaH<^ii 
Sli^jrlTnd by ever 85% after 60 P^PuUnon do- 
blinw. T^Js dramatic change in the "l?**'"^^"™?!^' 

on asspy. Fin»Uy. the coU*ction of "»drv«itxal d^et.^" 

str^n, 6en=ta«d by this project -°;^l^"™'^^'*Su 
available to: 1) confinn ph«notyp« observed in *e i«J0U 
U) make double mutants; and m) cross into different 

w^^qulre a high-througbp^it strategy to Ef 
^ou«nds Of ta^ed delerion strains Tow^ds i^^^^^^ 

a robust PGR deletion rtrategT has bc«n de^"^'^'^'^ 

de^doped to make the thousands of reqwed oUgonu 
deotides". ThU itiadiine has been designed to rn.m- 
mile the kbou* and reagent consumption evolved m 
S gonudcotide synthesis. Over 20,000 oliS--^^"^^ 
have been synthejiied on a single machine in one year 
^Tco'" n2 cents per b«e. Th. 42.000 oUgonu- 
deotides that vrfll be required to delete ^\ of the ORF* 
Si ;he yeast genome Cone 86r«er. one 68mer and five 
"smerrL eLh of the 6.000 YC-" Ol^^co u^db ^ ^- 
thesized in less than ayear usxngthe three S'6->'«" 
thesizers currently in operation in our labouratory. 

* D^loping a sl^tem capable o^-'lvf f ^^""2 
o£ taeged deletion strains is also esseonal to scaling Ais 
5p?oach to the geaome-mde Level. The f'^'f;' ^" 
tfist b. addressed axe Tag amplification, «iay ^6^51^. 
«iisitivity of the d«tertl6n system and "oss-hybrimza- 
tioftThe problem of biasing the relative levels of *e dif- 
fertnt tags during the amplification «cp has b..n 



AOey and c, '^"T^?^"? ttVPtagJ«2 SS^SAdeUen straws 
1^ tea f^l^flJ^SJ^r^^ ». A plot Show 

pool *™™ relBtive Bbundanoo tho atfaiA ana 

th^SSffiSf i^a* titie ^oini «ew rK«Tnaltt«l to tha 
uet obsafv ed In the gtBnln9 pool- 

minimized by using a wnimon psir P^^^" 
die fict that aU of the tags are the same lengtiu 
Sir base composition and are free of secondary sttuc- 
'^e The existing light-directed synthcs^ '^^^^'if^ 
can generate high-de^S^V an^y. conta.mr^g 65,000 
Afferent oligonudeotidcsi'. v,hich is wcU beyond th^ 
r^uitements of this project. Furthermore, the current 
flXeISm detection system can readily nieasun: con- 
fluorescent « 7 which should be sufficient fbr 

rtionitoring pools comainmg more «f^0 f 
st„jns (see Methods; pool Si«). FinaUy, initial 
Sdlzadon results with the J31 tags sugge^ that ^OSS- 
hybridization is minimal and largely avoidable. of 
S^e tags ^ be tested for cross-hybridization before use 

in the ddetion luategy. . ..^ 

The goal of this proje^ is to provide expenm«^ da^ 

for thousand* of deletion Strain* m a rapid and cost 
Iffea^ manner. These data. 'I-S wi^^« -^^^ 
of individual deletion strams, will be a vduable resource 
for investigating the function of genes in yeajt. 
^ SoLl applications of the mcle^r bar-<:od.ng 
strategy include th* generation of a complete set of 
taggeddeletions in a 'true' wild-type yeast strain, one 
^Ifh-sbeenrecently^olatedfi^r-.'^l'^^^ 
niche The coUection of ddetion strains could be relcasea 
bacL ^to their natural environment and the popu^tion 
mS«d over time by surveying Ae PCR-ampUf^bk 
MM This type of analysis would be useful for deter- 
XlnR the activity of genes as a function of the envi- 
ronmLt^Mole^lar^ar^coding m^ 
invaluable tool iu this emcigmg fiddof molecular ecoi 



molecular tags will also fedUtate the task of kecp- 
inettiS oftbe thowands of ddetion strains- Normally. 

SSddi^io. strains "\^«^-''^^'=^lA^ttib^; 
codes) that are placed on the outside of the tubes or 
^t^^ platef. incorrect labeUlng of tubes or pumng 
Te" rongs^ain in the wrong tub. »^ f«'°« P-^fieT^ 
that can be minimized by having a unique ^^f^f^^ «» 
^e actual deletion strains themselves The ^dent^ of 
any Strain could be confirmed by perfomnns » ^""ple 
tae-spedfic PGR or by analysing the tag on an ^^Yj^' 
Kore, the fe« that the tags are phy^'cally linked 
the mutations in the deletion strains may make it pos^ 
siblc to perform recombination experiments on a 
genome-wide leveL The Hnked natuit: of the 
ir^s may also allow the entire collection of deletion 
stcaim to be mated inic different genetic back^ounds. 
However, many appUcations only require that the mol- 
ecular BE be in the same cefl as the mutation. Por otam- 
ple tags could be introduced on a plasmid or into a 
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ntimer «.d 1.5 U veni polymm« (New Eng5»n<l 

lection of muams. j.^,^,^ „ ^ , d,.„ 25 cydes of 94 -C f?J 5°^^^ 

The molecul»r bar-coding s^^^ify " ""L . ti: is s 72 *C for 30 $. foUovwd by a 3 mm Extension »t 72 C. The 

iniiyib^iA^lf^^oriSa^n^inye^^il^rhii^p?^^^^'-^ id iurifcd PCR pra^ua wis cloned into Smd hn«r,zcd 

aS to any ask. in vifro or i« that r«<i^ r« Iwep «« J^^^^ cLits««..) and trtnrfonnrf bto decir^ 

nln^tioas (ccUs. DNA fr»gBi<:at», mol«<rales, and s6 Pf'^^^nys, (GibaxBRU.vmUc colon.*. w«« 

Ur« (for «aDipl<:. s^or^dwy ssmcture. . unique lOmer t»6 1 Bmer «nuncn taj 

SMu""« combining common 9«>crs (or m«r.) were chrm- 0=* ^ tr^^poson Tn903 w« u.*d a* the dommBnt 

o«i^rdLtinE thi» SKP eontigviou. stretches of homol- ™™ , mwket". Expr«si«n of di« (rarf gene coafers r*jli- 

not A<rwrl. The output of this ptOgmoKPc WW i list ot w. ^ ^ ^ pj^^ polymeraiB, 10 mM TVm CpH 8.5), 

the r«ultirig set 6f 51,082 JQm«J P«»cd *"«B»' » j ^19* "C theo 30 cydBof IS sat94«C IS » atSO C 

sS™d filter in whkh a pair-wi« analyiis ""l^ »rtd 30 S »C 72 -C folbwcd by a 3 mln e««B.on « 72 C D« 

»^ diminate W «queneet most Uk«!y tt. J^ii^^bi oUBon«eleotid.«y«th™.eh*niistry. theproport^n 

P«-fbrmin8 th« P>it-«1« m™^ of S -tagth Wrs in «. unpui-ified ,»ti.pk I, low- PCR 

ffiolds 8«-=»<'<' ^^'l' ^"ir^S,*^*'; « 5tlm.« Pr.d»a, s^eratrf with primei. «f this lenjth xesuU i« low 

were obtained » the sttiflg^ncy level wa. ■"y™*"^ ;„ i,^ormitioT» on the ends Iflg. 2: bhie f-'^^f^^'. 
Tnclsed. The 4.500 ZOmer. d„crib.d her. were scl«red frotn '^^S .^^^^^^ ^^^^^ 

the set containing 9.105 20«.t^ , !'"P^«'«'V^^™,!?Mn«X^produJ^^ 

Hybridization to the high-denritr t^i^^"-^ > '""^^^^ ^^'^1^-^.1^- 

Slorcssently labelled ollgonudeotida. complemenaiy to IM 5^ a,* end. »f the Gm PCR J^' 

d^Ut tag saquenees. were synlhe»iied aang an >^ M;O S- 96 »r » pcR was P«fc'^«'.'«'°8 ' ,*! 

w«ll oHaonuClMtide synHiesixsr' r>e1ds wert q..ant.6cd by on* « ^ ^ , ^, lo-bwe piflners lh*t art 

reVu irib^^b«.ce^i«n™-'i* hoXo- =f ^'f* ini^in^^^^are 
usS without poriEcation in hybridization aswys. ^Sf (L.2). The amplification conditions for the ^^^^^^ 

S .bridi«.tion mixtura < Sx SSPE-T) coWimei ^ M NiCU «> ^ condition, used in the fiisi round. The PCS 

SMNaH,POH, 5«.M EDTA««1 0.003*TritonX-100^^d^- >d°^«« ^ ^ grf used w.thoat 

tion. th. aolvltion, COnUined ''^^(^^c" sT^^^^^ purifiatioa in the subsequent tr^nifonnanon. 

L1^■;a"Tf'S;e^«''^^^ Tr.nsftrn.atiot«.Vea«srow.hmedj.««.dthesjnd„d«J^^ 

20 min hybridization at 37 'C. the array was waslied 10 nm« for^ipul„insy*«th.ve b«n d«f o^'''^ J- ^^/P 

with IX SSPE-T at 22 >C. The wash tolution was ttmoved and ^,,^,2^ ^^^j-^ sua CUPl CJ ). isosfn'^ 

I- hvSrid^ttion thamber was re-filled With 6x SSPE-T ptior "J ° ^^jgc. w«> used for the tr,nsfbrm»tjon» (provided by 

o Hyb*d^"tion events wer. detected by using an ;^'^*J^,Uer). Cells (1 x 10<) were tr^isformed wi£h 

a«^ iol fa.." .0 otdte the fluoiocei™ the reiuhing emi«|0n I „g „f , v,r^ pcr product ^^^^ 

de«aed Oiins a photomuhiplier tube through a 530 niti aPP method", -ttansfetmed cdls were 5^ « 

^l^t^a,Tmo\iii\u Dynanii«). Th* '"t*** ""V -r in YPD for 4 h and plated on YPO plae» eontaifl'ps 

3t "r«o"tL"tf^ l.« foo ™^1^13 (Gi-bco 3RL). Lar« C418.r«^^t «lom« 

o^iitiative siwial for «ch cf the 4300 tag sequences. The col- 5 j'Jj ^y^tini after 3 d erowth at 30 'C and tiie tacK 

?^d d'a^vas analysed using GcneOiip software tow. Colonies (»Krom «aA tr«isfcr^^^ 

Construction ofthepFA-TACpWid. A PCRcksnmg st^^^ Sonies^re Aecked by replica pbting onto the^pt^P^^^ 

BT was used to introduce an 18-bp tag Pining «h* drt,p-out medium for each of the ll different »™P^ 

multiple cloning region of th. pFA6.)anMX2 -^l-*- J^CdjB not SliOwn). The conficmed delenoa «~?» 

PGR pTOduti w,. Ba'crated using the pPA6-kanWX2 mod- f^^^^^^ ^1 « .70 -C. Th* 1 1 au«.trophu: delenon 

l^e » tempUte and I pair of prirpers complementary^ the sj^^.n^^^y^_^ ^^^^ ^ ^^^^ ^ ^^^^^^ ^^^^^ 

r«8ioDs flanging the marker (underlined sequence). ^ ne ^ ^ ^^^^ 
pruning »ite is i-eluded « an IBW estension on the 5 etvl 
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1 e ..... ml liruid YPD cul- army wcrt msasured- After suturactirtg the bAckgroufl^, tl« 
Gen«±tiiis lb* ddetioa p*ot Scpa^ltc 20-^ bqutd YPD cui ^ normalized lo ihe «gitoi from ch« reference strain 

tagged deletion strains- The culturts wcrr grown .i J 30 C unm " ^fraction of initial' values were obtained by 

nll-ex^nential ph.s. (O.S-l.O OD ). ^ ^^^^^^^^^j^^^J ^vWHe^-nAli.«l v^lue. frooi «ch time pouit by the 

ccU$ were w^nbiiied to g«ieratr it P°?lc T^Jzcd valu^ fr^rn the iniiiat time point A x«dc of one 

was stored In lO-OD^o ^^^^ ^^>^"^^ "^^ ^' indicates that the relative abundance of that deletion strain has 

.. , ^ nnf Chanced in the popvUrion. A ratio less than one indicates 

sroplificMion Kartion «as performed on \»J '^B"'"':^."?!'^ la the plot shown in figure 4d (ADE minus meiiam), the 
Uobtrf from a mid-«ponemiil ph^ culture of ^h=f delct^n «raia was «=d »s the "=f"en« StrJin. Similar 
pool >s previously descibed". 0« of the oommon ampH- 2^*'^''=''°^; ^^^'^^ „f „th=r nott-AOE ddetion 

included in lO-fold «C«s ^^'^'^^^.f^^/^J^^'J^o « .^refe^rrL sn-ain. to the fiitut^ several difltrent »T.trol 
^c^trie PCR. The IDO-mJ reacnon ""^^^ ^^"p'f^i" 't^ns wfll bt constructed .nd ipduded in the ddet^on pool. 

„,o„ 1 ^^'-^■^'^'^^PI^I^^^^^-) id genome th« ihould not affc« the fifiew of ,he cell (in pseudo 
S.^^d1.T.nd^!5^^S^?>^ |:n«.d.p,ic3teds»e.inter^nic«^o..^d»cn. 

Jntly hWled SSmer tag amplieons veie ^^y.^y^^f"^'^^ ^^ot^Z w"e ^ fem *e pool and pkted on rieh 

the «m without purLCca«i0.i. The 2D!H*l hybnd>»ation '^^^^^^.^^p^ ,^,h»dc,.iol The nunj.er 

mre corLne± 6x SSP^T. nM ^'"^""f "^^^ f^f StS" "deli ^ deletion str^ns) .md white 

50 m Of the t»8 .mpUficrion t^»ctin« [-25 nM °f ^Ini^^l^ nine deUtion «t«Lins) was determined for each 

Tags; 2.2 nN£ e»eh). The hybridis-tion, wwhinj and scannuie ^he percentage of ted colonies v™» detcr- 

condiiionl wen at described sbave. ^^^^ ^ ^.^^ p^lj^j^ j,^^ theindueS were nominliMd to the 

^ . ,- . .V. «aol oereentase of red cnlonies that vittt obicrwd in the aarting 

e:lnXre1,C„^h:'^o»X^(^=^^^^^^^^^^ Pa.lS«.TH.pro«ssw.repeat«.in.edium»iss- 

„d alLquois containing J ^ 10' celU were han^sted from the >ng adenine. 

culture at O. 3. 6. 9. 12 and 2J h for subsequent andT^'S 0° „nc=ntiatioo of a tag *n » hybridization minute 

hiKh-dcnsity array. The doubUng tm>e f'' P»P"'"'^^^^ fs't^ly P^P«ti<.»al to the total number of ags in the pool 

approximately UO min in SDC medium. To ensure that ^e ]l^^^%^Jf^^^ , p,^ «ntUning 6.0OO strain, v-ould 
cl^I, were harvested f™«mid-lojt phase '^^l*' J^Un ani^iv^^^tag«-=n=«tioa of 85 pMdOOpmole. 

toe points, a series of 2-fo d diiutJons of d^^'^" 'tTbe g^erat^d by a single tag ampUfication reac.ion/ « ,000 

were made at the beginning of the «xp"ir.™t. ^™ ^ £ ^^^^^^^^^^^^^^^^^j^.^^ 

^^rl'ifl'^ ?\ jl™O^TxTo"'?7x1tJi «d2J X 1^^ of 200 iil). This concentration is weU within th. detection Um- 

dls^^^regiUed Ly':al2ng 2-fold serial dilutions with SDC i« of ,1,. iluoresc^t scanner, 
^rfium. At each ti^e point. -3x10' ceDs were harvcMed 

from a culture that waa dostst to mid-log phase l&«w«n AoknowlodgcmBnt* 

05-1.0 Ot)/nil). This procedure was repeated using SDC-ADt ^^^j, j. s. Foi»r, B. Hubbk, £ Genwien. D. 

and SDC-TRP medium. toekhan. R. Sapobki, and Che^fram Affymttm E. Wrraafrtf, 

Data analysis, The growth tares of die tagged deletion^ini m /&hards, and Q. Cavttfor hdpful discusritfn/ and <• er<ira,l 
the pool are measured relative to a reference sti^m. The rettf ^^j„„ gf^f^ rnauuscripi. 

,nce strain has. tag integrated intoaicgion of — — ^ 
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